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Potent Inhibition of HIV-1 Reverse Transcriptase 
and Replication by Nonpseudoknot, "UCAA-motif " 
RNA Aptamers 

Angela S Whatley^ ^ Mark A Ditzler^ ", Margaret J Lange\ Elisa BiondP ^ Andrew W Sawyer^ Jonathan L Chang\ Joshua D Franken^ 
and Donald H Burke^'^ 

RNA aptamers that bind the reverse transcriptase (RT) of human immunodeficiency virus (HIV) compete with nucleic acid 
primer/template for access to RT, inhibit RT enzymatic activity in vitro, and suppress viral replication when expressed in 
human cells. Numerous pseudoknot aptamers have been identified by sequence analysis, but relatively few have been 
confirmed experimentally. In this work, a screen of nearly 100 full-length and >60 truncated aptamer transcripts established 
the predictive value of the F1 Pk and F2Pk pseudoknot signature motifs. The screen also identified a new, nonpseudoknot motif 
with a conserved unpaired UCAA element. High-throughput sequence (HTS) analysis identified 181 clusters capable of forming 
this novel element. Comparative sequence analysis, enzymatic probing and RT inhibition by aptamer variants established the 
essential requirements of the motif, which include two conserved base pairs (AC/GU) on the 5' side of the unpaired UCAA. 
Aptamers in this family inhibit RT in primer extension assays with IC^^ values in the low nmol/l range, and they suppress viral 
replication with a potency that is comparable with that of previously studied aptamers. All three known anti-RT aptamer families 
(pseudoknots, the UCAA element, and the recently described "(6/5)AL" motif) are therefore suitable for developing aptamer- 
based antiviral gene therapies. 
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Introduction 

The reverse transcriptase (RT) from type 1 human immuno- 
deficiency virus (HIV-1) plays an essential role in viral replica- 
tion by copying the RNA genome into double-stranded DNA 
(dsDNA) before insertion of the DNA into the host genome. RT 
inhibition is a proven therapeutic strategy for clinical treatment 
of HIV infection, although limitations in drug tolerance and the 
selection of drug-resistant viral strains continue to motivate the 
search for new therapeutic approaches. Nucleic acid aptamers 
composed of RNA or single-stranded DNA (ssDNA) that bind 
RT have been identified through the SELEX process (Sys- 
tematic Evolution of Ligands by Exponential enrichment). ^"^ 
Many of these aptamers compete with the natural primer/ 
template for access to RT, inhibiting the DNA polymerization 
and RNaseH activities of RT at low nanomolar concentrations 
in enzymatic assays. In addition, several RNA aptamers 
strongly suppress viral replication when expressed in cells^""" 
and could potentially be adapted for hematopoietic stem cell 
gene therapy. These findings have prompted substantial inter- 
est in developing RNA aptamer inhibitors of HIV-1 . 

Structural diversity among nucleic acid aptamers that bind 
RT correlates with significant functional diversity. For exam- 
ple, pseudoknot (Pk) RNA aptamers have been loosely cat- 
egorized as being either "family 1" (FlPk) — defined primarily 



by the presence of UCCG/CGGG in stem 1 and first iden- 
tified by Tuerk^ — or "family 2" (F2Pk) — originally a catch-all 
classification for any potential pseudoknot not matching the 
FlPk definition." In one study,^^ two F1Pk and two F2Pk 
aptamers strongly inhibited RT from HIV strains in which 
position 277 was Arg, but only the F2Pk aptamers contin- 
ued to inhibit when position 277 was Lys, which is a common 
polymorphism among circulating strains of HIV-1 . In two other 
studies,'''" the ssDNA aptamers RT1t49, R1T and variants 
of these two strongly inhibited all the members of a phyloge- 
netically diverse panel of purified recombinant RT, whereas 
another ssDNA aptamer, RT8, was highly specific for RT from 
a subtype B strain of HIV-1. We recently used mass spec- 
trometry footprinting to establish that the broad-spectrum 
ssDNA aptamers protect essentially the same surfaces of 
RT as those protected by dsDNA, whereas a pseudoknot 
RNA aptamer (F1 Pk) protects a substantially smaller surface 
area.^** Aptamer diversity also has implications for diagnos- 
tic applications. Li et al. identified two nonpseudoknot RNA 
aptamers (M302 and 12.01) that discriminated between wild- 
type RT and a particular octamutated, drug-resistant RT. Both 
aptamers M302 and 12.01 bind RT with low nmol/l affinity, but 
neither one inhibits RT enzymatic activity or competes with 
pseudoknot aptamers or primer/template for binding to RT' 
Functional variations among aptamers of different structural 
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classes could also lead to differences in susceptibility to the 
emergence of de novo resistance mutations and to the differ- 
ences in potential off-target effects. It is therefore important to 
understand the diversity of structural motifs that can bind RT. 

RNA pseudoknots, especially the F1Pk, have long been 
recognized as high affinity ligands for HIV-1 RT, and they domi- 
nated the first three populations of RT-binding RNA aptamers 
that were described. ^'^'^^ Interestingly, no such convergence 
on pseudoknots occurred among RNA and ssDNA aptamers 
from other selections, such as RNA aptamers selected to bind 
RT from Moloney murine leukemia virus, feline immunodefi- 
ciency virus or avian myeloblastosis virus, ssDNA aptam- 
ers selected to bind HIV-1 RT, and RNA aptamers selected to 
differentiate between drug-resistant and wild-type HIV-1 RT.^ 
These observations together suggest that there are likely 
additional, nonpseudoknot structural motifs present in these 
populations. Indeed, by applying high-throughput sequencing 
(HTS) and a newly developed bioinformatics pipeline to the 
70HRT^^ population of HIV-1 RT aptamers," we recently iden- 
tified another structural element termed "(6/5)AL," in which an 
asymmetric internal loop with six nucleotides in one strand 
and five in the other is flanked by generic stems with different 
length requirements.^^ 

The 32N population from the first RT-aptamer selection^ and 
the subsequent 70HRT^^ and SOHRT,^ populations" were all 
originally selected to bind RT from HIV-1 strain BH10. Low- 
throughput sequence (LTS) analysis of these three populations 
identified 18, 46, and 44 nonidentical published sequences 
(108 total), respectively, from among 194 total reads (95, 
54, and 45, respectively, for the three populations). Potential 
pseudoknot-forming elements were identified within most of 
the sequences from all three selections. More than half (61 
of 108) contained the FlPk signature sequence (11, 31, and 
19 aptamer sequences, respectively for the three popula- 
tions). Alternative F2Pk lacking this signature sequence were 
proposed" for another 36 sequences (1 1 and 25 from popu- 
lations 70HRT,^ and 80HRT,^, respectively). A small handful 
of FlPk and relatively compact F2Pk have been confirmed 
experimentally,^^'^^'^^ but several of the manually assigned 
F2Pk have very large loops or very short stems that may be 
incompatible with pseudoknot formation, leaving open the pos- 
sibility that portions of those transcripts other than the putative 
pseudoknots may be responsible for RT-binding affinity. In addi- 
tion, nearly all of the sequences in the 70HRT^^ and 80HRT^^ 
populations were sampled only once, indicating that significant 
untapped sequence diversity remains within both populations. 

These observations raised two immediate questions: (i) 
whether the 30-50 nucleotide F1 Pk and F2Pk identified by 
sequence gazing represent the core RT-binding segments 
within the original 118-134 nucleotide transcripts, and (ii) 
whether additional RT-binding structures might be present 
within these populations. By screening nearly 100 full-length 
aptamers and >60 truncated variants, we established that the 
original F1 Pk definition is highly reliable in defining the RT- 
binding module within aptamers that contain this sequence, 
and that most but not all of the original F2Pk account for RT 
binding by those RNAs. Importantly, this work also identified 
several nonpseudoknot RNAs, including two aptamers that 
form similar secondary structures with a conserved UCAA 
internal bulge and that inhibit RT with IC^^ values below 10 



nmol/l. HTS analysis identified >150 independent examples 
of this structural element and, in conjunction with enzymatic 
digestion and mutational analysis, defined the sequence 
requirements for forming the RT-binding module. The UCAA 
aptamers reduce infectivity of virus produced in the presence 
of aptamer and display a potency that is at least comparable 
to RNA aptamers with other structural motifs. This new UCAA 
family of aptamers represents one of the few published exam- 
ples of nonpseudoknot RNA structures that inhibit RT, and 
illustrates the ability of structurally unrelated RNA aptamers 
to bind and inhibit the same protein target. 

Results 

Sequence and structural diversity within 70HRT^^ and 
801-IRT^^ aptamer populations 

Sixty additional aptamer plasmid sequences were obtained 
to augment the published LTS data set and gain new insights 
into 70HRT,^ and 80HRT„ aptamer population diversity" 
Thirty-five of these represent sequences that had not previ- 
ously been sampled, bringing the total published LTS data set 
to 143 independent sequences from 254 reads (Supplemen- 
tary Figure S1 and refs. [3,4]). Inhibition of primer extension 
by RT from HIV-1 subtype B strain HXB2 was evaluated for 
full-length aptamer transcripts from 98 plasmid isolates from 
the 70HRT^^ and 80HRT,^ populations (approximately 118 
and 134 nt, respectively), and these aptamers were grouped 
according to their relative potency (Figure 1, Supplemen- 
tary Figure S2 and data not shown). When small transcripts 
(26-48 nt) corresponding to putative pseudoknot cores were 
similarly tested, all of the F1 Pk cores (12 of 12) inhibited RT 
as potently as their corresponding full-length versions, con- 
firming that the FlPk signature sequence accurately identi- 
fies the RT-binding elements within the full-length 116-134 
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Figure 1 Inhibition of DNA polymerization by HIV-1 reverse 
transcriptase by RNA aptamers. Quantification of primer exten- 
sion assays showing fraction of primer converted into full-length 
product in the absence of RT ("MR"), control reactions without 
aptamer ("0"), and various full-length aptamers, as indicated be- 
low the graph. Plotted values and vertical error bars represent the 
mean and SD of the fraction extended to full length (normalized to 
the fraction extended to full length in the no-aptamer control, set 
to 100%) for three independent experiments. "F1Pk" and "F2Pk" 
indicate aptamers that carry sequences that match the consensus 
motif definitions of these two families of pseudoknots." Several 
aptamers meet motif definitions for both pseudoknot families. No 
pseudoknots were evident in the sequences of the five samples on 
the right. NR, no reaction. 
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nt transcripts. In contrast, only 60% of the F2Pk cores (9 of 
15) identified manually at the time of the original selection 
inhibited RT as well as their corresponding full-length tran- 
scripts (Supplementary Figure S3). In some cases, manual 
screens of truncated transcripts for the other 40% identified 
new inhibitory F2Pk that had not been recognized previously 
(Supplementary Figure S2), but even after this analysis 
there remained 19 inhibitory aptamers for which no pseudo- 
knots were evident (Supplementary Figure S2). Two of the 
most inhibitory nonpseudoknot aptamers were 80.103 and 
80.111, which suppressed full-length product formation by 
RTto below the detection limit of this assay (Figure 1). These 
two aptamers were therefore chosen for further study. 

Deletion analysis was used to define the segment within 
aptamer 80.103 that is required for binding RT. Several RNA 
transcripts that began at nucleotide 21 or 25 inhibited RT to 
essentially the same degree as did the full-length aptamer, 
whereas all transcripts that began at nucleotide 31 failed to 
inhibit (Figure 2a). Similarly, transcripts of aptamer 80.103 
that ended at position 84 or 94 inhibited RT, whereas all tran- 
scripts that ended at position 74 failed to inhibit (Figure 2a). 
Thus, the functional core of aptamer 80.103 is fully contained 
between nucleotides 25 and 84. A similar analysis for aptamer 
80.111 established that its functional core is fully contained 
between nucleotides 9 and 68 (Figure 2a). Potential second- 
ary structures consistent with these functional boundaries 
are shown in Figure 2b. 

High-throughput comparative sequence analysis reveals 
a novel conserved UCAA bulge motif 

During the course of the analysis above, we completed a 
separate study of the 70HRT^^ aptamer populations by HTS 
analysis, including development of a bioinformatics pipeline 
specifically adapted for SELEX data sets.^^ HTS reads from 
the 80HRT^^ population were parsed into the "clusters" of 
closely related sequences with edit distance <7 to capture 
essentially all mutational variants that share common ances- 
try with the surviving sequences. The 5,000 most abun- 
dant clusters collectively comprised 613,313 quality-filtered 
sequence reads, and all clusters contained at least two such 
reads. As with the 70HRT,^ population, the HTS data set 
for the 80HRT^^ population is dominated by pseudoknots 
(data not shown). The cluster containing aptamer 80.103 is 
the 27th most abundant cluster in the population, with 768 
unique sequences (2,666 total sequence reads) represent- 
ing 0.43% of the population. These sequences were aligned 
using the software Mafft, and an initial secondary structure of 
the functional core was predicted from the observed patterns 
of conservation and covariation using RNalifold. The stems 
and a single-stranded UCAA bulge are highly conserved 
within the cluster, whereas the sequences in the terminal 
loop, single nucleotide bulges, and sequences near the 5' 
and 3' predicted boundaries are less conserved. 

The cluster containing aptamer 80.111 is much more 
rare (3,311th most abundant cluster), with only six unique 
sequences and eight total sequence reads, representing 
0.001% of the population. The limited sample number and 
diversity precluded meaningful evaluation of intracluster con- 
servation and covariation. 



However, a search for matching character strings within 
the experimentally determined functional boundaries and 
within the original 80N random region identified a stretch 
of 16 nucleotides (GGATCAAATTAATGCT) within the func- 
tional core of 80.1 1 1 that is highly conserved among 16 other 
independent clusters (perfectly conserved in nine), and that 
extends to 23 nucleotides of conservation in 12 (Figure 2c 
and Supplementary Figure 4a). Potential pairing of this ele- 
ment with the 5' primer-binding segment (used for amplifying 
the library during the selection) produces a UCAA bulge that 
is analogous to the conserved feature within the 80.103 clus- 
ter (Figure 2b). The most abundant clusters to contain this 16 
nt conserved element was aptamer #342, so called because 
it is the 342nd most abundant within the 80HRT^^ population 
(123 unique sequences among 199 reads). Its putative sec- 
ondary structure includes the full 23 nucleotide version of the 
conserved element and is fully contained within the first 54 
nucleotides. Transcripts comprising nucleotides 1-134 (full 
length) or nucleotides 1-54 strongly inhibited primer exten- 
sion by RT, whereas truncations comprising nucleotides 
9-54 or 9-44 did not inhibit (Figure 2d), consistent with pair- 
ing between the 5' constant region and the full 23 nucleotide 
version of the conserved element (Figure 2b). 

UCAA aptamer family 

The apparent convergence of aptamers 80.103, 80.111, and 
#342 on a previously unrecognized RT-binding structural 
motif led us to apply a more rigorous informatics analysis in 
four stages. The first stage involved two rounds of curated 
search and refinement. We first built a covariation model (CM) 
based on the sequences of these three aptamers, aligned 
in the CM to preserve the predicted UCAA bulge. The CM 
search identified 28 new sequences that conformed to the 
CM, including two sequences that formed the two stems with 
completely different sequences than those observed in the 
three seed sequences. These two sequences were added to 
the alignment of the three original seed sequences to gen- 
erate a new, refined CM that was again searched against 
the 80HRT^^ population. This second search identified a total 
of 57 sequences that conformed to the refined CM, includ- 
ing four sequences that formed the UCAA structure entirely 
within the 80 nt random region, and were thus not subject to 
the constraints of the constant primer-binding sequences. 

The second phase of the analysis was a semiautomated 
CM search of only the 80N random region of the 80HRT^^ 
population, after first removing the 5' and 3' constant regions 
from the data set. The four sequences that formed the UCAA 
structure without the involvement of the constant regions 
were then used to seed multiple rounds of semiautomated 
CM searching and refinement. This process identified 42 
clusters that formed the UCAA element entirely within the 
random region, including the cluster that contains aptamer 
80.103. The consensus structure based on these 42 clus- 
ters, which is free from sequence constraints imposed by the 
primer-binding regions, is a simplified stem-loop structure in 
which the UCAA bulge is flanked on one side by two highly 
conserved base pairs (AC/GU) and on the other by a largely 
generic helix that is interrupted by a single unpaired U resi- 
due (Figure 2b). 
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gggcauaagguauuuaauuccauaCCTAGGACGAAAGCGATAATCGGGCCTGGAGGATCAAATTAATGCT [63 nts] 

gggcauaagguauuuaauuccauaATGTGGATCAAATTATTGCTCACT^TGCTC [80 nts] 

gggcauaagguauuuaauuccaua — CGAAAGTGAATTAATGCTCACTGTG [82 nts] 

gggcauaagguauuuaauuccauaGCACGGATCAAATTTCTACT [90 nts] 

gggcauaagguauuuaauuccauaCCGTGGATCAAATTGGTGCTCACTGTG [83 nts] 

gggcauaagguauuuaauuccauaGGCGCAATCCAATCGGATCAAATTAATGCTCACTGTG [72 nts] 80HRT^4 

gggcauaagguauuuaauuccauaGAGCACACCTCACTGGAGATCGCTTGTGGATCAAATTGATGCC [67 nts] 

ccauaGACAGAAAGGTCAGGCTGAGTTGGAGAACCTCACCCTAATCGTAAACTGGATCftAATTAATGCT GATTACGCC [ 3 6nt s ] 

445,1,26 [46 nt s ] ...GGATCAAATTAATGCT^^CXfilfiTCCTAATGCACCTGAGGAGCGACCgggcauaagguauuuaai 
70HRT 2890,1,2 [52 nts] ...GGATCAAATTAATGCTii^ciaxaccGAgggcauaagguauuuaauuccaija 

14 4464, 1, 1 [55 nts] ,..GGATCAAATTAATGCT^^X£X£CCAGATGACGAGTCTgggcauaagguauuuaauuccaua 



Figure 2 Functional cores of UCAA family aptamers. (a) Primer extension assays were carried out in the presence of truncated 
aptamer 80.103 (left) and aptamer 80.111 (riglit), bearing the nucleotide segments indicated above the lanes. Gel images show only the 
region near the full-length product (arrow). "0", reaction product formed in the absence of aptamer. *The indicated functional core lies be- 
tween nucleotides 25 and 84 for aptamer 80.103 and between nucleotides 9 and 68 for aptamer 80.1 1 1 . (b) Predicted secondary structures 
of aptamers 80.103(25-84), 80.111(9-73) and #342(1-54). Numbers after the aptamer names indicate the 5' and 3' nucleotides in the 
functional cores. Lower case nucleotides indicate segments derived from the constant primer-binding segments of the library. Numbering 
on the secondary structures in this and subsequent figures indicates nucleotide positions in the aptamers from which they are derived to 
facilitate comparison among truncated variants. Equivalent inhibition results were obtained from 80.103(25-84) when two additional G's 
were appended to the 5' end to increase transcription efficiency (not shown). Structures of aptamers 80.1 1 1 and #342 represent the two 
major pairing pattern observed for sequences that paired with the constant region. Also shown in this panel is the conservation patterns 
among the most abundant sequences within each of the 42 clusters for the UCAA family isolates that are fully contained within the 80N 
random region of the library. Black boxes, >95% conservation within cluster and black dots; Black letters and gray dots, 90-95%, conser- 
vation; Grey letters and open circles, 85-90% conservation; R or Y >85% purine or pyrimidine, respectively, (c) Alignments of 80.1 1 1 with 
other sequences in the yOHRT,^ and 80HRT,4 populations revealed a shared 16 nt sequence that can base pair with part of the constant 
primer-binding region (bold). Underlined segment extends the shared sequence element to approximately 23 nt.The three numbers in the 
sequence identifiers indicate (rank number of the cluster within the population), (rank number of this specific sequence within the clus- 
ter), (number of repeat samplings of that sequence). Thus, sequence "342.1 .49" is the most abundant sequence within cluster #342, and 
this exact sequence was sampled 49 times, (d) Four truncations of #342 were in vitro transcribed to test base pairing hypothesis. These 
truncations were used in primer extension assays, and the percent of primer converted to full-length product were plotted relative to the 
no-aptamer control as in Figure 1 . Full-length aptamer 80.1 1 1 was included for comparison. 



In the third phase of the analysis, the 5' and 3' constant 
regions were re-appended to the sequences in the full 
80HRT^^ population, and the CM built from sequences that 
form the UCAA structural element entirely within the random 
region was then used to search this data set. This search 
identified an additional 1 09 clusters that form the UCAA bulge 
motif secondary structures by pairing with elements in the 5' 



primer-binding segment, most frequently in the pairing com- 
bination AAU'^UCC/GGAUCAAAUU, in which represents 
the site across from the UCAA element in bold (Figure 3b). 
The frequent utilization of a specific pairing register with the 
constant region suggests that the constant region contrib- 
utes modestly to satisfying the required sequence informa- 
tion content of the motif. Among the aptamers that carried the 
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16 nt conserved element that pairs with the 5' constant 
regions in aptamers 80.111 and #342, only one of these 
(#947) was also present in this final set of sequences from 
the CM search based on the conserved sequence features 
that were not constrained by the 5' or 3' constant regions. 
These three sets of hits therefore yield a total of 167 clusters 
within the UCAA family from the 80HRT^^ population (Sup- 
plementary Figure S4a,b). 

Finally, in the fourth phase of the analysis, similar searches 
were applied to several 70HRT populations described previ- 
ously.^^ This process identified 14 additional UCAA struc- 
tures, 11 of which were identified using the consensus 
structural model derived from the SON random region and 
three of which contained the 1 6 nt conserved sequence noted 
above (Supplementary Figure S4c). Although the 70HRT^^ 
aptamers are from an independent selection, their 3' primer- 
binding segment is identical to the 5' primer-binding segment 
of the 80HRT„ population" and can pair with the conserved 
16 nucleotide sequence. The 70HRT2^(j^ population experi- 
enced more stringent selection conditions than the 70HRT^^ 
population. Under the more stringent conditions, the rela- 
tive abundance of sequences that carried the (6/5)AL and 
(6/5)ALcp motifs significantly increased with selection strin- 
gency, whereas the relative abundances of sequences with 
FlPk and F2Pk motifs significantly decreased, and these 
trends are readily explained by their respective RT-binding 
affinities." Applying a similar analysis to the UCAA-motif 
aptamers revealed that their relative abundance increases 
slightly in the 70HRT24q^ population, suggesting that the 
UCAA aptamers are intermediate between pseudoknots and 
(6/5)AL aptamers. 

Experimental confirmation of secondary structures 

Aptamer 80.103(25-84) was analyzed using enzymatic 
probing to determine which portions of the molecule show 
susceptibility to cleavage by endonucleases SI, VI, or T1 
(Figure 3a), and the results were mapped onto the pro- 
posed secondary structure (Figure 3b). Nucleotides 40, 52, 
and 64 are partially cleaved in the absence of endonuclease 
(Figure 3a, lane 1), suggesting that these nucleotides are 
unstructured under native conditions and that they are suscep- 
tible to in-line attack by the 2'-0H. These positions map to a 
single nucleotide bulge, to the terminal loop, and to the UCAA 
bulge, respectively. Nucleotides 47 and 49-53 in the terminal 
loop are susceptible to cleavage by nuclease SI (Figure 3a, 
lane 4), which preferentially cleaves single-stranded regions 
within folded RNA. Likewise, nuclease T1 cleaves after each 
G residue when the RNA is fully denatured (Figure 3a, lane 
2) but only after the limited subset of G's that remain unstruc- 
tured under native conditions (Figure 3a, lane 6). Comparing 
these two lanes identifies G residues that are predominately 
single or double stranded in the native structure. Nucleotides 
38, 45, and 58-59 are susceptible to cleavage by nuclease 
VI (Figure 3a, lane 5), which preferentially cleaves double- 
stranded RNA. These cleavages are again consistent with the 
predicted secondary structure. Aptamer 80.1 1 1 was subjected 
to similar enzymatic nuclease treatments and the cleavage 
patterns were also consistent with the predicted UCAA bulge 
structure (Supplementary Figure S5). 
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Figure 3 Enzymatic probing of aptamer 80.103(25-84). 

(a) Electrophoretic analysis of the products of digestion reactions 
utilizing nucleases T1, VI and SI. NR, no reaction; den, diges- 
tion carried out under denaturing conditions; nat, digestion carried 
out under native conditions; OH, alkaline digestion at elevated pH. 
Note that alkaline and T1 digestions leave 2',3'-cyclic phosphate 
products, whereas SI and VI digestions leave 3'-0H products;^^ 
hence, SI and V1 digestion products are shifted approximately 1 
nucleotide upward relative to alkaline and T1 digestion products. G 
residues marked on the left are numbered according to their po- 
sitions in the full-length aptamer transcripts, (b) Digestion results 
were mapped onto secondary structure of aptamer 80. 1 03(25-84). 
Open wedges, cleavage sites for nuclease VI ; filled wedges, cleav- 
age sites for nuclease SI; arrows pointing inward, cleavage sites 
for nuclease T1 ; arrows pointing outward, G residues that are not 
cleaved in the native structure. Similar analysis for aptamer 80.1 1 1 
is presented in Supplementary Figure S5. 

Mutational analysis reveals simplified sequence 
requirements for the members of the UCAA aptamer 
family 

Results from the comparative sequence analysis and enzy- 
matic probing were used to guide the construction of ter- 
minal and internal mutations that were introduced to refine 
the nucleotide requirements of the UCAA aptamer family. 
Disrupting two of the helical elements of the 60-nt truncated 
aptamer 80.103(25-84) (mutants "A Stem" and "C Stem," 
respectively) abolished RT inhibition, whereas restoring 
base pairing potential ("B Stem" and "D Stem," respectively) 
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partially rescued RT inhibition (Supplementary Figure S6a). 

Additional variants strongly inhibited primer extension by RT 
when the terminal heptanucleotide loop (GAAUAGA) was 
simplified to a stable UUCG tetraloop (80.103UNGG), when 
the unpaired nucleotides on the 5' side of the stem were 
removed (80.103RB1), and when the 3' end was trimmed 
to nucleotide 80 (data not shown). Combining these three 
separate mutations yielded the simplified, 50 nucleotide-long 
RU25-80 variant (Figure 4), which retains strong RT inhibi- 
tion in primer extension assays. 

A similar analysis was carried out for aptamer 80.111 
(Figure 4 and Supplementary Figure S6b). RT was strongly 
inhibited by a mutant in which the 9 nt apical loop sequence 
was changed to a stable UUCG tetraloop ("80.111UNCG"), 
and by a circularly permutated mutant in which the original 



5' and 3' ends were joined through an additional GAAA seg- 
ment and transcription was initiated from an internal posi- 
tion (Figure 4b, right). These observations establish that 
there is no requirement for specific sequence or structural 
features at the helical termini. Disrupting a helical element 
just below the UCAA motif ("16-17 Disrupt") abolished RT 
inhibition, whereas restoring base pairing potential ("16-17 
Restore") rescued inhibition (Supplementary Figure S6b). 
Two mutants that lacked the single unpaired U in the lower 
stem ("BPU" and "Delta U") both inhibited RT slightly less 
than the original 80.1 1 1 . 

To understand the contribution of the UCAA bulge, this 
element was mutated to UCAC for the full-length and trun- 
cated forms of aptamers 80.103 and 80.111. In the context 
of the truncated aptamers, these mutations abolished RT 



80.103 80.111 80.111 

(RU25-80) (UNCG) (CP) AS_1 




80.103 80.111 




Figure 4 Utilization of UCAA bulge and peripheral flanking sequences, (a) Sequences and projected secondary structures of some 
of the aptamer variants analyzed here. Boxed portions indicate segments addressed by individual mutants, (b) Two gels showing results of 
simplifying aptamers 80.103 (left) and 80.1 1 1 (right). The "RU" designation indicates that three internal unpaired nucleotides were removed 
from the corresponding truncations of aptamer 80.103. 80.111 mutants "UNCG" and "CP" are shown in panel A. Open and filled arrows 
indicate positions of unextended primer and full-length product bands, respectively. Results from additional internal mutations are given 
in Supplementary Figure S6. (c) Partial disruption of UCAA element. Primer extension by RT was measured in the presence of several 
variants of aptamers 80.103 and 80.1 1 1 . Black bars indicate parental sequences. Gray bars indicate mutated sequences in which UCAA 
bulges were mutated to UCAC. Error bars indicate the SDs among three independent measurements. NR, no reaction. 
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inhibition (Figure 4c), consistent witli tlie postulated func- 
tional importance of the UCAA element. In contrast, the full- 
length molecules were not sensitive to mutating the UCAA to 
UCAC, potentially indicating that peripheral elements in the 
full-length molecules form additional stabilizing contacts that 
compensate for partial disruption of the UCAA element. 

A final series of transcripts interchanged upper and lower 
helical elements from different aptamers. For this analysis, 
two categories of UCAA aptamers were considered: those 
that form the lower stem by pairing with the primer-binding 
segment using the 16 nucleotide element identified above, 
and those that form their secondary structures fully within 
the random region. Strong inhibition was observed for chi- 
meric aptamer AS1, in which the lower portion represents 
a consensus among aptamers within the first category and 
the upper portion is derived from aptamer 80.111UNCG 
(Figure 4a and data not shown). In contrast, no appreciable 
inhibition was observed when the lower portion was derived 
from 80.1 1 1 and the upper portion was derived from aptam- 
ers from the second category, or when any further trunca- 
tions or modifications were introduced in the distal helical 
element that is closed by the tetraloop (AS2 through AS23, 
Supplementary Figure S7). The 60 to 63 nucleotide aptam- 
ers 80.111UNCG and AS1 therefore represent minimal ver- 
sions of aptamer 80.1 1 1 . 

UCAA aptamer family inhibits RT in the low nanomolar 
range 

To compare quantitatively the functional cores of each aptamer 
with the corresponding full-length transcripts, the concentra- 
tion dependence of the inhibition was determined for both the 
full-length and minimal cores of aptamers 80.103, 80.111, 
and #342 (Supplementary Figure S8 and Figure 5). The 
aptamer concentration required to inhibit 50% of RT activity 
(\C^) was slightly lower for the full-length aptamers than for 
the cores. Aptamer #342 had the lowest value of 1 .6 ± 
0.3 nmol/l for the full-length and 2.9 ± 0.4 nmol/l for aptamer 
#342(1-54). Inhibition by aptamer 80. 103 was similar, with an 




ICj-g value of 2.1 ± 0.5 nmol/l for the full length and 5.8 ± 0.8 
nmol/l for the 50 nt functional core RU25-80. Aptamer 80.1 1 1 
was slightly weaker, with an ICj.„ value of 7.6 ± 1 .0 nmol/l for 
the full length and 11.7 ± 1.0 nmol/l for 80.111(9-70), but 
hybrid aptamer AS1 was as potent as aptamer #342, with 
an IC50 value of 1 .4 ± 0.3 nmol/l. Under these same assay 
conditions, the full-length transcripts for F1 Pk aptamer 70.05 
and for F2Pk aptamer 70.08 had ICj.„ values of 13 ± 3 nmol/l 
and 6.3 ± 1.1 nmol/l, respectively (Figure 5). The aptamers in 
the UCAA structural family are therefore comparable to, if not 
slightly better inhibitors than, these two pseudoknots. 

UCAA aptamers inhibit HIV-1 replication 

Inhibition of HIV-1 replication in single-cycle infectivity assays 
was measured to evaluate antiviral bioactivity of these 
aptamers. We recently demonstrated that aptamers can be 
expressed to high levels from a cassette in which stable RNA 
structures flank the expressed aptamer on both the 5' and 
3' sides. The aptamers become packaged within the nascent 
virus and inhibit viral replication in the subsequent cycle of 




Aptamer 

Figure 5 Concentration dependence of UCAA aptamer inhibi- 
tion. ICgj values and SDs were calculated from triplicate assays 
are shown for each aptamer. 



Aptamer expressed in virus-producing cells 



Figure 6 UCAA aptamers inhibit HIV in single-cycle viral rep- 
lication assays. The effects of intracellular expression of aptam- 
ers on HIV-1 replication were determined by a recently described 
single-cycle infectivity assay.'^ The y-axis represents the relative 
fraction of target cells that became infected with VSV-pseudotyped 
virus, with normalization to the average of the three noninhibitory 
controls (diagonal stripes). "pcDNA," parent vector in which the 
expression cassette was built; "Empty," plasmid carries an empty 
expression cassette with no aptamer; "Arbitrary," plasmid directs 
expression of a 70-nucleotide fragment of the luciferase gene 
flanked by the same primer-binding sequences as those used for 
the 70HRT^^ library. Previously studied reference constructs (white 
bars)'^ were included for comparison with the indicated full-length 
(gray bars) and minimal core or mutant aptamers (black bars). The 
80.111(Dbl) construct cotranscribes two expression cassettes for 
aptamer 80.111. Aptamer 80.1 11 (UCAC) is a mutant in which the 
UCAA of aptamer 80.111(9-70) was replaced with UCAC. Struc- 
tural families of each group of aptamers are indicated schemati- 
cally above the bars: left to right, (6/5)AL, pseudoknots, UCAA. 
Plotted values represent the mean ± SDs for triplicate transfec- 
tions. Pk, pseudoknots. 
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replication^' in a manner that is strongly correlated with RT 
inhibition in vitro.^^ Various full-length and truncated aptamer- 
encoding DMAs and controls were therefore cloned into the 
aptamer expression cassette, and HIV-1 infectivity was deter- 
mined for pseudotyped virus produced from cells expressing 
these aptamers. No viral inhibition was evident for controls 
representing the parental plasmid vector ("pcDNA3.1"), a 
modified expression platform containing all nonaptamer com- 
ponents ("Empty"), and a plasmid carrying an arbitrary 70 nt 
fragment of luciferase mRNA in place of the aptamer ("Arbi- 
trary"). In contrast, infectivity was significantly reduced (P < 
0.001 ) for all aptamers tested in comparison with controls. The 
UCAA aptamers inhibited as well as, or slightly better than, the 
pseudoknot aptamers (Figure 6, compare Pk versus UCAA), 
and inhibition by aptamer 80.103 was greater than that of 
aptamer 80.111. For aptamer 80.111, inhibition increased 
when two copies of the aptamer were expressed within the 
same transcript (Figure 6, 80.111(FL) versus 80.111(Dbl)), 
raising the possibility that polyaptamer constructs may pro- 
vide a general strategy of increasing local aptamer concentra- 
tion and augmenting viral suppression. The UCAA functional 
core of aptamer 80.111 was slightly more inhibitory than 
the full-length aptamer (Figure 6, compare 80.111(FL) with 
80.111(9-70)). Expressing the UCAC mutant of the func- 
tional core provided as much viral inhibition as expressing the 
UCAA functional core, in contrast with the strong difference 
in enzyme inhibition between these two constructs (Figure 
5c). We speculate that the flanking RNA structures within 
the expression cassette may provide additional contacts that 
compensate for partial disruption of the UCAA element. 

Discussion 

This work identifies the UCAA element as a novel, potent, 
nonpseudoknot RNA module that inhibits DNA polymeriza- 
tion by HIV-1 RT and that suppresses HIV-1 replication in 
human cells. In the 20 years since RNA aptamer inhibitors of 
HIV-1 RT were first described,^ only the pseudoknots have 
received serious attention, in part because they strongly 
dominated LTS data sets from the early in vitro selections. ^"'^^ 
The structural simplicity of the F1 Pk and F2Pk motifs gives 
them a numerical advantage over other structural motifs in 
random sequence libraries that results in an over-abundance 
of pseudoknots in the final selected populations, even though 
more complex structural motifs with equivalent or improved 
binding abilities are clearly also present. Aptamers 80.103 
and 80.111 were first identified through manual screening 
of transcripts from approximately 100 aptamer isolates, and 
the core regions responsible for RT inhibition were identified 
through enzymatic probing and additional screening of RT 
inhibition by truncated variants. 

HTS analysis of aptamer population has made it pos- 
sible to describe SELEX populations more completely, as 
demonstrated in several recent studies.^^'^^-^" Comparative 
approaches have proven especially powerful for extracting 
structural inferences based on the divergent cloud of muta- 
tions within individual lineages, identifying convergence on 
specific structural motifs and evaluating the relative fitness 
among and within these structural motifs. In the present 



work, comparative sequence analysis of HTS data proved 
instrumental in generalizing the findings beyond the two 
exemplars identified from manual screening and in defining 
the motif as a whole, including identification of UCAA family 
aptamers in 167 clusters from the 80HRT„ population and 
14 from among 70HRT populations. The most abundant clus- 
ter, which includes aptamer 80.103, represents only 0.43% 
of the total 80HRT,^ HTS data set and could easily have 
been missed in the manual screen. It was even more fortu- 
itous that the exceedingly rare aptamer 80.1 1 1 was sampled 
in the manual screen, given that its cluster was sampled in 
only 8 reads. As with the recently described (6/5)AL motif, 
which was similarly rare in the 70HRT,^ data set (approxi- 
mately 3%)," the large data sets associated with HTS analy- 
sis greatly accelerated identification of the UCAA element 
as a cohesive motif and provided a high-resolution view of 
sequence requirements and subclasses. The (6/5)AL motif^^ 
and the UCAA motif expand the known major classes of RT- 
inhibiting RNAs from one (pseudoknots only) to three, each 
of which include identifiable subclasses. 

The core of the UCAA-motif element is an eight-nucleotide 
signature that includes the unpaired UCAA portion and highly 
conserved CG and AU pairs on the 5' side of this segment 
(Figure 2b). Modeling with the Rosetta web server (http:// 
rosie.rosettacommons.org) indicates a sharply bent structure 
(Figure 7) in which the first three nucleotides of the UCAA 
sequence interact with the each other and with the two con- 
served flanking base pairs to establish the stable bend, and the 
fourth nucleotide stacks on the downstream helix. In the crystal 
structure of RT in complex with dsDNA,^' approximately 1 8 bp 
fill the cleft between the polymerase and RNaseH active sites. 
DNA near the polymerase active site forms a short A-form 
helix that abruptly bends near the base of the "thumb" domain, 
followed by a longer B-form helix that extends to the RNaseH 
active site. It is intriguing to speculate that a UCAA-induced 
bend may be similarly located near the RT thumb domain. 

The subfamilies of UCAA aptamers differ in their helical 
requirements. About 75% of the UCAA clusters from the 
80HRT^^ population (125 of 167) utilize the 5' constant region 
to form the overall secondary structure. The remaining 25%, 
including aptamer 80.103, form the conserved secondary 
structure utilizing only nucleotides derived from the 80N ran- 
dom region. Among the UCAA aptamers and sequence vari- 
ants, all inhibitory constructs contain one long stem and one 
short one, with discontinuities built into the long stem. For 
aptamers 80.103 and #342, the long stem lies "below" the 




Figure 7 Stereo view of UCAA core as modeled by Rosetta. 

A 32 nt segment of 80.103 core RU25-80 (5'-UGUCU ACCGGGC 
UUCG GCCCGGU UCAA GGACA-3') was modeled by the Ro- 
setta web server. Straight lines indicate approximate trajectory of 
helical axis. 
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UCAA element in the structural depictions used here, and 
both aptamers include an asymmetric A/C-rich internal loop. 
For aptamer 80.111, the long stem lies "above" the UCAA 
element, and there is an asymmetric A/GG internal loop near 
the end of the stem. The fact that the long arm is found on 
both sides of the UCAA sequence suggests that the UCAA 
sequence does not dictate orientation of the aptamer with 
respect to RT, and instead serves primarily to establish the 
bend between the two stems. This is in contrast with the heli- 
cal length requirements of aptamers in the (6/5)AL family, for 
which stems 1 and 2, which are also defined relative to the 
internal loop, have different length requirements. The long 
arms of UCAA family aptamers carry irregularities that also 
appear to be important for RT recognition. The long arms 
of aptamers 80.103 and #342 include an asymmetric A/C- 
rich internal loop, whereas the long arm of aptamer 80.111 
includes an asymmetric A/GG internal loop near the end of 
the stem. RT inhibition was disrupted when the A/GG internal 
loop of aptamer 80.111 was converted into two base pairs 
(ASM; Supplementary Figure S7), or when the A/C-rich 
internal loop in the long arm of aptamer #342 inhibition was 
converted into three base pairs (data not shown). Irregularity 
near the end of the "long" helix may position the end of that 
helix for specific contacts, perhaps near the RNaseH domain. 
HTS analysis did not resolve length requirements and details 
of these distortions because of the high variability in their 
composition and placement relative to the overall structure. 

The strong correlation between RT in vitro and antiviral bio- 
activity in cell culture supports the use of anti-RT aptamers as 
tools for exploring viral pathogenesis and as potential thera- 
peutic agents. These results extend and strengthen previously 
observed correlations for pseudoknots^'^ and for the (6/5)AL 
element, and is consistent with a model in which intracel- 
lular and intraviral RT-aptamer interactions are responsible for 
the observed antiviral effects. It is likely that there are many 
additional RNA structural families that bind HIV-1 RT with high 
affinity and that can be accessed through a combination of spe- 
cialized selection methods. Comparative sequence analysis of 
HTS data sets has proven especially adept at accelerating the 
identification and optimization of such new aptamers, espe- 
cially against the high backdrop of the dominant pseudoknot 
F1 Pk and F2Pk motifs. Each new structural family enhances 
the possibilities for developing aptamer antagonists of HIV-1 . 

Materials and methods 

Materials. Synthetic DNA was purchased from Integrated 
DNA Technologies (Coralville, lA). Radiolabeled nucleotides 
for 5' labeling ([y-^^PJATP) were purchased from Perkin-Elmer 
(Waltham, MA). The p51 and p66 subunits of RT from HIV-1 
subtype B (HXB2 strain GenBank accession number K03455) 
were cloned into the protein expression construct pRT-Dual, 
kindly provided by Dr Stefanos G. Sarafianos, expressed in 
Escliericliia co// strain BL21(DE3) and purified essentially as 
described.^" Aptamer RNA was transcribed in vitro from syn- 
thetic DNA oligonucleotides or from polymerase chain reac- 
tion products amplified from plasmids" using phage T7 RNA 
polymerase. Transcripts were gel-purified as described^ and 
resuspended in deionized water. 



RT enzymatic inhibition assays. Primer extension was car- 
ried out essentially as described.^ Briefly, a Cy3-labeled, 
18-mer DNA oligonucleotide corresponding to the 3' end of 
tRNA'->'='^ was mixed with a 31-mer template in a 1:3 ratio to 
ensure that all primer was pre-bound to template. This mixture 
was heated to 90 °C in a heat block for 2 minutes and then 
annealed by cooling to room temperature. A reaction mster 
mix was assembled to contain (final concentrations) 30 \jmo\/\ 
dNTPs, 0.5 mmol/l ethylenediaminetetraacetic acid (EDTA), 
50 mmol/l Tris-HCI pH 7.8, 50 mmol/l NaCI^, 10 mmol/l dithio- 
threitol and 20 nmol/l RT, with the RT and dithiothreitol added 
last. The reaction master mix of 14 pi was aliquoted to each 
tube, along with either 2 |jl of aptamer solution (final concen- 
tration 100 nmol/l unless otherwise noted) or water. Reactions 
were initiated by adding 4 pi of a solution containing annealed 
primer/template and MgCI^ ((final concentration 20 nmol/l and 
6 mmol/l, respectively). After incubating at 37 °C for 10 min, 
reactions were stopped by adding 20 ^jI of 90% formamide, 
50 mmol/l EDTA, and a trace amount of bromophenol blue. 
Samples were heated to 90 °C for 2 min immediately before 
loading onto a 15% polyacrylamide, 8 mol/l urea denaturing 
gel. Gels were scanned for Cy3 fluorescence with a FLA9000 
phosphorimager (Fujifilm, Valhalla, NY). The fraction of primer 
converted to full-length product was determined by quantify- 
ing band intensities using ImageQuant software (Pharmacia, 
Piscataway, NJ) and normalized by setting the fraction con- 
verted to full-length product in the absence of aptamer to 
100%. Aptamer concentrations required for half-maximal inhi- 
bition (ICj;;, values) were calculated as described' by fitting the 
data with GraphPad Prism 6 software to a standard two-state 
sigmoidal dose response curve: Y = 1/(1 -i- 10'^[x - log(ICj.(,)]), 
where Y is the normalized fraction full-length product at a given 
aptamer concentration (x). Enzyme inhibition assays were per- 
formed in triplicate for all reactions from which IC^^ values were 
calculated. 

HTS and analysis pipeline. HTS data for the SOHRT,^ popula- 
tion were obtained and analyzed as described previously for 
70HRT„ and other populations." Flanking sequences required 
for lllumina sequencing and indexing were appended during 
two sequential polymerase chain reaction amplification steps 
using plaque-forming unit DNA polymerase. Final amplified 
products from 80HRT„ were pooled with other populations, 
loaded onto a single lane, bridge amplified and then run through 
100 sequencing cycles. Illumina's analysis software was used 
to generate fastq files with sequence calls and associated qual- 
ity scores for >3 million raw sequence reads per population. 
613,313 quality-filtered reads for the 80HRT,^ populations were 
aligned, clustered, and used to identify converged structures. 

Enzymatic probing. Secondary structures in solution were 
assessed by enzymatic digestion as described. For each 
reaction, 50,000-200,000 cpm of 5' radiolabeled RNA was 
digested under native conditions at 37 °C with ribonuclease 
T1 (0.005 U/pl for 2 min; Ambion; Life Technologies, Grand 
Island, NY), or SI nuclease (4.75 U/pl for 10 min; New Eng- 
land Biolabs, Ipswich, MA), or ribonuclease VI (5 x lO"'' U/|jl 
for 8 min; Ambion). All reactions were quenched with equal vol- 
umes of colorless gel loading buffer (10 mol/l urea, 15 mmol/l 
EDTA) and quickly cooled in a dry ice/ethanol bath. Products 
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of digestions were separated on 8 mol/l urea denaturing 15% 
polyacrylamide gels and analyzed as above. 

Cell lines, plasmlds and viral assays. Plasmids for direct- 
ing aptamer expression were constructed as previously 
described^'' and utilize a human cytomegalovirus (CIVIV) 
immediate early promoter. Proviral plasmid (pNL4-3-Aenv- 
CIVIV-EGFP) was kindly provided by Vineet KewalRamani 
(National Cancer Institute [NCI], Fredrick, MD) and car- 
ries the genome of HIV-1 strain NL4-3, in which the genes 
encoding vlf, vpr, vpu, net, and env have been deleted, and 
a CMV-driven enhanced green fluorescent protein (EGFP) 
reporter gene replaces nef. Cell culture, virus production and 
evaluation of single-cycle viral infectivity were carried out as 
described. The human cell line, 293FT (Invitrogen, Carls- 
bad, CA), was transfected with polyethyleneimine in 6-well 
cell culture dishes. Aptamer-expressing plasmids were trans- 
fected first (1 ^ig), followed four hours later by transfection 
with a mixture of pNL4-3-Aenv-CMV-EGFP (250 ng) and 
pMD-G (125 ng; Invitrogen) to produce pseudotyped HIV-1 
in the presence of aptamer. Medium was changed between 
transfections and again four hours after the second transfec- 
tion. Virus was harvested 48 hours posttransfection by filter- 
ing the medium through 0.45 ^im filters, and was quantified 
by p24 enzyme-linked immunosorbent assay. Fresh 293FT 
cells were infected with 25 ^il of filtrate so that 5-10% of the 
target cells would become infected in the no-aptamer control; 
this level of infection provides sensitive readout of aptamer- 
mediated viral suppression.^^ Cells were collected 48 hours 
post-infection, fixed with 4% paraformaldehyde, and ana- 
lyzed for EGFP fluorescence on an Accuri Flow Cytometer 
(BD Biosciences, San Jose, CA).The percentage of infected 
(EGFP positive) cells was normalized to p24 levels in each 
sample, and the average of the control samples was set to 1 . 
One-way analysis of variance and Student's f test were used 
to determine statistical significance between samples. 

Supplementary material 

Figure SI. Complete sequences of isolates from aptamer 
populations 70HRT^^ (a) and 80HRT^^ (b), as determined by 
low-throughput sequencing (LTS) of plasmids that were shot- 
gun cloned from the library. 

Figure S2. Initial screen and prioritization of 99 aptamers for 

inhibition of RTfrom HIV-1 strain HXB2. 

Figure S3. Results from screen of proposed F1 Pk and F2Pk 

cores. 

Figure S4. Aligned sequences of the UCAA family 
members. 

Figure S5. Enzymatic probing of aptamer 80.1 11 (9-73). 
Figure S6. Effects of internal mutations of 80.103 and 
80.111 on RT inhibition. 

Figure S7. Effects of helical swaps among subfamilies of 
UCAA aptamers. 

Figure S8. Aptamer concentration dependence of RT 
inhibition used in calculating IC^o values. 
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